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Abstract 

Let Ffc be the free group on k generators. A word w E is called primitive 
if it belongs to some basis of F^. We investigate two criteria for primitivity, 
and consider more generally, subgroups of F& which are free factors. 

The first criterion is graph-theoretic and uses Stallings core graphs: given 
subgroups of finite rank H < J < F& we present a simple procedure to deter- 
mine whether H is a free factor of J. This yields, in particular, a procedure to 
determine whether a given element in F^ is primitive. 

Again let w € F& and consider the word map w : G x . . . x G — > G 
(from the direct product of k copies of G to G), where G is an arbitrary finite 
group. We call w measure preserving if given uniform measure on G x . . . x G, 
w induces uniform measure on G (for every finite G). This is the second 
criterion we investigate: it is not hard to see that primitivity implies measure 
preservation and it was conjectured that the two properties are equivalent. 
Our combinatorial approach to primitivity allows us to make progress on this 
problem and in particular prove the conjecture for k = 2. 

It was asked whether the primitive elements of F& form a closed set in the 
profinite topology of free groups. Our results provide a positive answer for F2. 
Keywords: word maps, primitive elements of free groups, primitivity rank 



1 Introduction 

An element w of a free group J is called primitive if it belongs to some basis (free 
generating set) of J. When J is given with a basis X, this is equivalent to the 
existence of an automorphism of J which sends w to a given element of X. 

* Supported by Advanced ERC Grant 247034 of Aner Shalev, and by Adams Fellowship Program 
of the Israel Academy of Sciences and Humanities. 
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The notion of primitivity has a natural extension to subgroups in the form of 
free factors. Let H be a subgroup of the free group J (in particular, H is free as 



well). We say that H is a free factor of J and denote H < J, if there is another 



subgroup H' < J such that H * H' = J. Equivalently, H < J ii every basis of if 
can be extended to a basis of J. (This in turn is easily seen to be equivalent to the 
condition that some basis of H extends to a basis of J). 

Let Ffc be the free group on k generators with a fixed basis X = {xi, . . . , Xk}- 
We study finitely generated subgroups of Ffc (denoted H <f g Ffc) and relations 
among them using core graphs, also known as Stallings' graphs (See |Sta83j. Actually 
our definition is a bit different than Stalling's, see below). Associated with every 
subgroup H < Ffc is a directed, pointed, edge-labeled graph denoted Tx{H). Edges 
are labeled by the elements of the given basis X = {xi, . . . ,Xk} of Ffc. A full 
definition appears in Section [2j but we illustrate the concept in Figure 11.11 It shows 
the core-graph of the subgroup of F2 generated by X\X2X^ and X2X 2 . 



Core graphs are a key tool in the research of free groups, and are both used 
for proving new results and for introducing simple proofs to known results (see, 
for instance, |KM02t IMVW07] , for a survey of many such results and for further 
references) . 

A central new ingredient of our work is a new perspective on core graphs. There 
is a naturally defined notion of quotient on such graphs (see Section [3]). In particular, 
we introduce in Section [3] the notion of immediate quotients. This in turn yields a 
directed graph whose vertices are all core graphs of finitely generated subgroups of 
Ffc (w.r.t. the fixed basis X). A directed edge in this graph stands for the relation 
of an immediate quotient. This is a directed acyclic graph (DAG) i.e., it contains 
no directed cycles. As always, reachability in a DAG induces a distance function 
between vertices. Namely px(x,y) is the shortest length of a directed path from x 
to y. We mention that the transitive closure of the immediate quotient relation is 
the relation "being a quotient of" which is a partial order (a lattice, in fact) on all 
core graphs of f.g. subgroups of Ffc. The following theorem gives a simple criterion 
for free factorness in terms of this distance: 

Theorem 1.1. Let H,J <f g Ffc, and assume Fx{J) is a quotient ofTx{H). Then 
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Figure 1.1: The core graph Tx (H) where 
H = (x 1 X2 1 Xi,Xi 2 x 2 ) < F 2 . 



H < J if and only if 



p x {H, J) = rk(J)-rk{H) 
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We note that px{-, •) can be explicitly computed, and this theorem thus yields 
automatically an algorithm to determine, for two given H, J <f g whether if is a 
free factor of J. In particular, it can serve to detect primitive words (see Appendix 1X1). 
More generally, for any f.g. free groups H < J, this theorem can serve to detect the 
minimal number of complementary generators needed to obtain J from H (Corollary 

ESP- 
In fact, the core graph of every H <f g F& has finitely many quotients (or reachable 
vertices). This set is also known in the literature as the fringe of H (see, e.g. 
|MVW07] ) . For example, Figure [37T1 shows the fringe of the subgroup H — ([£1,0:2]). 
The difference in ranks between H and F 2 is 1. However, the distance between the 
corresponding core graphs in the fringe is 2. This proves that H is not a free factor 
of F 2 , or equivalently that [xi,^] is not primitive. We elaborate more in Appendix 

Remark 1.2. We stress that there are other graph-theoretic algorithms to detect 
free factors and primitive words, including simplifications of the seminal Whitehead 
algorithm (the algorithm first appeared in |Whi36a[ IWhi36b] , for its graph-theoretic 



simplifications see |Ger84|,ISta99] ). Our approach, however, is very different and does 
not rely on Whitehead automorphisms. We elaborate more on this in Appendix [A] 

Theorem 11.11 is also used for the other concept we study here, that of measure 
preservation of word maps. Associated with every w G F^ is a word map. We view 
w as a word in the letters of the basis X. For every group G, this mapping which 
we also denote by w maps G x G x ■ ■ ■ x (j — > G as follows: It maps the fc-tuple 

k 

(gi, . . . , gk) to the element w(gi, . . . , gk) G G, where w (g\, ... , gk) is the element 
obtained by replacing x\, . . . , Xk with g\, . . . , g^ (respectively) in the expression for 
w, and then evaluating this expression as a group element in G. 

During the last years there has been a great interest in word maps in groups, and 
extensive research was conducted (see, for instance, |Sha09] , |LS09] ; for a recent book 
on the topic see |Seg09|). Our focus here is on the property of measure preservation: 
We say that the word w preserves measure with respect to a finite group G if when 
/c-tuples of elements from G are sampled uniformly, the image of the word map w 
induces the uniform distribution on G. (In other words, all fibers of the word map 
have the same size). We say that w is measure preserving if it preserves measure 
with respect to every finite group G. 

This concept was investigated in several recent works. See for example [LS08J 
and [GS09J, where certain word maps are shown to be almost measure preserving, 
in the sense that the distribution induced by w on finite simple groups G tends to 
uniform, say, in L\ distance, when \G\ — > oo. 

Measure preservation can be equivalently defined as follows: fix some finite group 
G, and select a homomorphism «g G Hom(Fk, G) uniformly at random. A homo- 
morphism from a free group is uniquely determined by choosing the images of the 
elements of a basis, so that every homomorphism is chosen with probability l/|G| fe . 
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We then say that w G is measure preserving if for every finite group G and a 
random homomorphism «g as above, ac{w) is uniformly distributed over G. 

We note that there is a stronger condition of measure preservation on a word w 
that is discussed in the literature. In this stronger condition we consider the image 
of w over the broader class of compact groups G w.r.t. their Haar measure. Our 
results make use only of the weaker condition that involves only finite groups. 

Measure preservation can also be defined for f.g. subgroups. 

Definition 1.3. For H <f g we say that H is measure preserving iff for any finite 
group G and «g G Hom(Fk,G) a randomly chosen homomorphism as before, cxc\h 
is uniformly distributed in Hom(H,G). 

In particular, 1 ^ w G F^ is measure preserving iff (w) is measure preserving. 

It is easily seen that primitivity or free factorness yield measure preservation. 
The reason is that as mentioned, a homomorphism in Hom(Fk, G) is completely 
determined by the images of the elements of a basis of F&, which can be chosen 
completely arbitrarily and independently. 

Several authors have conjectured that the converse is also true: 

Conjecture 1.4. For every w G F k , 

w is primitive -<=>- w is measure preserving 
More generally, for H <f g F k , 

H < Ffc •<=>- H is measure preserving 

From private conversations we know that this has occurred to the following math- 
ematicians and discussed among themselves: T. Gelander, A. Shalev, M. Larsen and 
A. Lubotzky. The question was mentioned several times in the Einstein Institute 
Algebra Seminar. This conjecture was independently raised in 

Here we prove a partial result: 

Theorem 1.5. Let H </ 9 F& have rank > k — 1. Then, 

H < Ffc H is measure preserving 
In particular, for every w 6 F 2 : 

w is primitive <^=^ w is measure preserving 

2 It is interesting to note that there is an easy abelian parallel to Conjecture II. 41 A word w GFk 
is primitive, i.e. belongs to a basis, in Z fc = Fk/F' k iff for any group G the associated word map is 
surjective. See |Seg09j , Lemma 3.1.1. 
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The proof of this result relies, inter alia, on Theorem 11.11 Note that a set of 
k — 1 elements W\, . . . , Wk-i € can be extended to a basis iff it is a free set that 
generates a free factor. Thus, the result for subgroups can also be stated for finite 
subsets as follows: Let r > k — 1. A set {w\, . . . ,w r } C F& can be extended to a 
basis iff for every finite group G and random homomorphism «g as above, the r-tuple 
(cec(wi), . . . ,ac{w r )) is uniformly distributed in G r , the direct product of r copies 



There is an interesting connection between this circle of ideas and the study of 
profinite groups. For example, an immediate corollary of Theorem 11.51 is that 

Corollary 1.6. The set of primitive elements in F 2 is closed in the profinite topology. 

We discuss this corollary and other related results in Section [7J 

In order to prove Conjecture 11.41 one needs to find for every non-primitive word 
i« G Ft, some witness finite group G with respect to which w is not measure pre- 
serving. Our witnesses are always the symmetric groups S n . 

It is conceivable that our method of proof for Theorem 11.51 is powerful enough 
to establish Conjecture 11.41 We define two categorizations of elements (and of f.g. 
subgroups) of free groups 7r(-) and 4>{-)- They map every free word and free subgroup 
into {0, 1, 2, 3, . . .} U {oo}. We believe these two maps are in fact identical. This, if 
true, yields the general conjecture. Presently we can show that they are equivalent 
under certain conditions, and this yields our partial result. 

The first categorization is called the primitivity rank. It is a simple fact that if 
w G Ffc is primitive, then it is also primitive in every subgroup of F^ containing it 
(see Claim 123]) . However, if w is not primitive in F&, it may be either primitive 
or non-primitive in subgroups containing it. But what is the smallest rank of a 
subgroup in which we can realize w is not primitive? Informally, how far does one 
have to search in order to establish that w is not primitive in F&? Concretely: 

Definition 1.7. The primitivity rank of w e F k , denoted n(w), is 



If no such J exists, tt(w) = oo. A subgroup J for which the minimum is obtained is 
called w-critical. 

This extends naturally to subgroups. Namely, 

Definition 1.8. For H <f g F&, the primitivity rank of H is 



of G. 




w e J < F fc s.t. 

w is not primitive in J. 
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Again, if no such J exists, tt(H) = oo. A subgroup J for which the minimum is 
obtained is called //-critical. 



For instance, tt(w) = 1 if and only if w is a proper power of another word (i.e. 
w = v d for some t> G F fc and d > 2). In Section H] we show (Corollary 14. 2p that in 
Ffe the primitivity rank takes values only in {0, 1, 2, . . . , k} U {oo} (the only word w 
with tt(w) — is w — 1). Lemma [4.11 shows, moreover, that 7t(w) = oo (tt{H) = oo, 



resp.) iff w is primitive (H < F&). Finally Lemma [6781 yields that 7r can take on every 
value in {0, . . . , k}. For example, if F& is given with some basis X = {x±, . . . , x k } 
then for every 1 < d < k, ^{x^ . . -x]) = d. It is interesting to mention that ir(H) 
also generalizes the notion of compressed subgroups, as appears, e.g., in |MVW07] : 
a subgroup H <f g F& is compressed iff tt(H) > rk(H). 

The second categorization of sets of formal words has its roots in |Nic94] and 
more explicitly in |LP10j . It concerns homomorphisms from F^ to the symmetric 
groups S n , and more concretely the probability that 1 is a fixed point of the per- 
mutation w(o~i, . . . , <7fc) for some w £ F k when o~xi - ■ ■ i°~k S n are chosen randomly 
with uniform distribution. More generally, for a subgroup H <f g F^ we study the 
probability that 1 is a common fixed point of (the permutations corresponding to) 
all elements in H . We ask how much this probability deviates from the correspond- 
ing probability in the case of measure preserving subgroups, i.e. from rfc(H) . (We 
continue the presentation for subgroups only. This clearly generalizes the case of a 
word: for every word w ^ 1 consider the subgroup (w).) 

Formally, for H <f g F^ we define the following function whose domain is all 
integers n > 1 where a n G Hom(Fk, S n ) is a random homomorphism with uniform 
distribution: 



Clearly, if H is measure preserving, then vanishes for every n > 1. 

Nica |Nic94| showed that for a fixed word w ^ 1 and large enough n, it is 
possible to express $ w (n) (=$( w )(n)) as a rational function in n. We show below 
that this is easily extended to apply to $h(^) for arbitrary H </ 9 F&. Nica's clever 
observation was used in |LP10] to introduce a new categorization of free words, 
denoted 0(-), which, like 7r(-), associates a non-negative integer or oo to every formal 
word (note that in |LPf 0] the notion of primitive words has a different meaning than 
in the current paper). This categorization can also be extended to arbitrary finitely 
generated subgroups of F&. More specifically, it is shown in Section [5] that for every 
H <f g Ffc and n large enough (say, at least the number of vertices in the core graph 
of H), we have 



$ H ( n ) = P ro b[Vw G H a n (w)(l) = l] 



1 



(1.1) 



n 



rk(H) 




(1.2) 



i=0 
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where the coefficients 04(H) are integers depending only on H. We define <p(H) as 



Thus, <f>(H) measures to what extent the probability that f is a common fixed 
point of H differs from nr ^ (H) , the corresponding probability if H were measure pre- 
serving. The higher <fi(H) is, the closer the probability is asymptotically to nr l (H) ■ If 
H is a measure preserving subgroup, then <p(H) = 00. 

As it turns out there is a strong connection between tt(H) and <f)(H). Already 
Nica's result can be interpreted in the language of 0(-) to say that <p(w) — 1 iff w is 
a power, that is iff %(w) = 1. But the connection goes deeper. In proving this, we 
calculate these functions using the core graph of H and its quotients. It turns out 
that both tt(H) and 4>(H) can be computed explicitly via the subgraph of the DAG 
induced by all descendants of Tx(H). 

In the calculation of <fi(H) we use the core graph Tx(H) and its quotients to parti- 
tion the event that I is a common fixed point of a n (w) of each w G H (see Section [5]). 

Fortunately, the same core graph and quotients can also be used to find the 
primitivity rank vr(iJ), as shown in Section HI Lemma [4.31 shows that all .^-critical 
subgroups (see Definition II. 8p are always represented in the fringe (set of quotients) 
of H. Theorem 1 1 . 1 1 then shows directly how to calculate n(H) using the fringe. 

We show that under certain conditions, the two categorizations tt(-) and </>(•) 
indeed coincide. 

Proposition 1.9. Let H <f g Fk- Then for every i < rk(H) + 1, 



2. Moreover, if 7r(H) = <p(H) = i then Oi(H) equals the number of if-critical 
subgroups of Ffc. 

The second part of this proposition is in fact a generalization of a result of Nica. 
For a single element idGFj which is a proper power, namely ir(w) = <p(w) = 1, let 
w = u d with d maximal (so u is not a proper power). Let M denote the number 
of divisors of d. It is not hard to see that the number of w-critical subgroups of F^ 
equals M — I: these subgroups are exactly (u m ) for every 1 < m < d such that m\d. 
This shows that the average number of fixed points in the permutation a n (w) goes 
to M as n — > 00. This corresponds to Corollary 1.3 in |Nic94| (for the case L = 1)0. 

Nica's result was more general in a different manner: it involved the distribution of the number 
of i-cycles in the random permutation a n (w), for any fixed L. He showed that asn^ 00, the limit 
distribution depends only on d 1 where w = u d as above. 



follows: 




the smallest integer iwith Oi(H) 7^ if $#(^) ^ 
00 if & H (n) = 



(1.3) 
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The connection between tt(-) and </>(■) goes beyond the cases stated in Proposition 

11.91 To start off, if ir(H) = oo, then H < and therefore H is measure preserving, 
and thus (f>(H) = oo. In addition, Lemma 16.81 states that both tt(-) and </>(•) are 
additive with respect to concatenation of words on disjoint letter sets. Namely, if the 
words u>i,u>2 £ Ffc have no letters in common then it{w\W2) = n{wi) + ^{^2) and 
(j)(wiW2) = 4>(wi) + <f){w2). Moreover, if the disjoint w\ and u>2 satisfy both parts of 
Proposition 11.91 then so does their concatenation W\W2- 

In view of this discussion, the following conjecture suggests itself quite naturally: 

Conjecture 1.10. 

1. For every H <f g 

n(H) = <j>(H) 

2. Moreover, a^miH) equals the number of if-critical subgroups of F^. 

Specifically, for a single word w, Proposition 11.91 states that for % = 0,1,2, tt(w) = 
i -v^ <p{w) = i. As mentioned, the possible values of 7t(H) are {0, 1, 2, . . . , k} U {00}, 

and 7r(H) — 00 iff H < F^. We also have 7r(H) = 00 =>■ <fi(H) = 00 (a free 
factor subgroup is measure preserving). Thus, when rk(H) > k — 1, the value of 
7r(iy) uniquely determines (j)(H) and the two values coincide. In other words, when 
rk{H) > k - 1 

vr(iJ) = 

This shows, in turn, that when H is measure preserving, we have tt(H) = <p(H) = 

00, and so if is a free factor. This yields Theorem 11.51 The same argument shows 
that Conjecture II .41 follows from part (1) of Conjecture 11.101 and suggests, in partic- 
ular, a general strategy towards proving Conjecture 11.41 

As an aside, the second parts of Proposition 11.91 and Conjecture 11.101 say some- 
thing interesting on the average number of fixed points in the random permutation 
a n (w). We conjecture that for every w and for large enough n, this average is at least 

1. In other words, among the family of distributions of S n induced by free words, a 
random uniformly chosen permutation has the least average number of fixed points. 
This point is further elaborated in Section [HI 

At this point we should clarify the relation of these results and some of what we 
did in |LP10j . There we introduced /?(•)- yet another categorization of formal words. 
Just like </>(•) and 7r(-) it maps every formal word to a non-negative integer or 00. As 
it turns out, 7r(-) and /?(•) coincide. This follows from Theorem 1 1 . 1 1 and from Section 
HI The definition of ir(-) is simpler and more elegant than the original definition of 
/?(•). As shown in jLPlOj for i = 0,1, 4>(w) = i <^=^ fi(w) = i. A partial proof was 
given there as well for the case i — 2. In Section [6] we complete the argument for 
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i = 2 and generalize it to prove Proposition 11.91 



The paper is arranged as follows. In section [2] we introduce the notions of core 
graphs, their morphisms and their quotients. In Section [3] we introduce our new 
perspective on core graphs, including the notion of immediate quotients and the 
mentioned DAG, and then prove Theorem ll.il In SectionH]we analyze the primitivity 
rank of any H <f g and show how it can be computed from the quotients of V x{H) 
in the DAG of finite rank subgroups of F&. Section [5] is devoted to proving that (j){H) 
is well defined and can be indeed computed from the same descendants of Tx{H). In 
Section [6] we establish the results connecting </>(•) and tt(-), culminating in the proof 
of Theorem 11.51 The concluding sections are devoted to two different consequences 
of the main results: the characterization of elements of which are primitive in 
its profinite completion (Section [7]) and the possible values of the average number 
of fixed points in the image of a word map on S n (Section [8]). The discussion in 
the three appendices is not necessary for the main results of this paper, but it does, 
in our view, complete the picture. In particular, we illustrate in Appendix [A] the 
algorithm to detect free factor subgroups. 

2 Core Graphs and their Quotients 

All groups that appear here are subgroups of F^, the free group with a given basis 
X = {xi, . . . ,Xk}- Some of the relations we consider depend on the choice of the 
basis. We first describe core-graphs, which play a crucial role in this paper. 

2.1 Core Graphs 

Associated with every subgroup H < F& is a directed, pointed, edge-labeled graph. 
This graph is called the core-graph associated with H and is denoted by Tx{H). We 
recall the notion of Tx{H) the Schreier (right) coset graph of H with respect to the 
basis X. This is a directed, pointed and edge-labeled graph. Its vertex set is the set 
of all right cosets of H in F^, where the basepoint corresponds to the trivial coset H. 
For every coset Hw and every letter x« there is a directed z-edge (short for Xi-edge) 
going from the vertex Hw to the vertex Hwxi. 

The core graph Tx{H) is obtained from Tx{H) by omitting all the vertices and 
edges of Fx (H) which are never traced by a reduced (i.e., non-backtracking) path that 
starts and ends at the basepoint. Stated informally, we omit all (infinite) "hanging 
trees" from Tx{H). To illustrate, Figure ET] shows the graphs Tx{H) and Tx{H) 
for H = (xiX2X± 3 , XiX 2 x^ 2 ) < F 2 . 

Note that the graph Tx{H) is 2fc-regular: Every vertex has exactly one outgoing 
j-edge and one incoming j-edge for every 1 < j < k. Every vertex of Tx{H) has at 
most one outgoing j-edge, and at most one incoming j-edge for every 1 < j < k. 
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Figure 2.1: Tx(H) and Tx(H) for H = (xiX2X± 3 ,XiX 2 Xi 2 ) < F 2 . The Schreier 
coset graph Tx (H) is the infinite graph on the left (the dotted lines represent infinite 
4-regular trees). The basepoint "(g)" corresponds to the trivial coset H, the vertex 
below it corresponds to the coset Hx\, the one further down corresponds to Hx 2 = 
HX1X2X1 1 , etc. The core graph Tx (H) is the finite graph on the right, which is 
obtained from Tx (H) by omitting all vertices and edges that are not traced by 
reduced closed paths around the basepoint. 

It is an easy observation that 

canonically 

ir 1 (T x (H))=TT 1 (T x (H)) = H 

where the canonical isomorphism is given by associating words in to paths in the 
coset graph and in the core graph: We traverse the path by following the labels of 
outgoing edges. For instance, the path (from left to right) 



corresponds to the word " 1 . (See also |MVW07j . where this fact ap- 

pears in a slightly different language). 

Core graphs were introduced by Stallings [Sta83j. Our definition is slightly dif- 
ferent, in that we allow the basepoint to have degree one. 

In fact, a "tail" in T X (H), i.e., a path to the basepoint can be eliminated by 
replacing H by an appropriate conjugate. However, we find it unnecessary and less 
elegant for our needs. 

We now list some properties of core graph, most of which are proved in at least 
one of |Sta83[ IKM02[ IMVW07] . The remaining ones are easy observations. 
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Claim 2.1. Let H be a subgroup ofFk with an associated core graph V = Tx{H). 
The Euler Characteristic of a graph, denoted %(■) is the number of vertices minus 
the number of edges. Finally, rk(H) denotes the rank of the group H . 

1. rk(H) < oo <^ T is finite 

2. rk(H) = 1 - x (r) 

3. Let A be a finite, pointed, directed graph with edges labeled by {x±, ...,£&}. 
Then A is a core graph (corresponding to some J < F&) if and only if A satisfies 
the following three properties: 

(a) A is connected 

(b) With the possible exception of the basepoint, every vertex has degree at 
least 2. 

(c) For every 1 < j < k, no two j-edges share the same origin nor the same 
terminus. 

4. There is a one-to-one correspondence between subgroups of and core graphs. 

5. There is a one-to-one correspondence between subgroups of F^ of finite rank 
and finite core graphs. 

In Appendix O we present a well known algorithm, based on Stallings' foldings, to 
obtain the core graph of every H <f g Fx given some finite generating set for H. 



2.2 Morphisms of Core Graphs 

In our framework, a morphism between two core-graphs Ti and T 2 is a map that sends 
vertices to vertices and edges to edges, and preserves the structure of the graphs. 
Namely, it preserves the incidence relations, sends the basepoint to the basepoint, 
and preserves the directions and labels of the edges. 

As in Claim 12.11 the proofs of the following properties are either easy variations 
on proofs in [Sta83, K M02[ IMVW07] or just easy observations: 



Claim 2.2. Let H\,Hi < F& be subgroups, and Ti,r2 be the corresponding core 
graphs. Then 

1. A morphism rj : Ti — > T 2 exists ^ Hi < H 2 , 
and in this case, rj* : vr 1 (r 1 ) — > ^(I^) is injective. 

2. If a morphism exists, it is unique. 

3. Every morphism in an immersion (locally injective at the vertices). 
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2.3 Quotients of Core Graphs 

With core-graph morphisms at hand, we can define the following rather natural 
relation between core-graphs. 

Definition 2.3. Let Ti,]^ be core graphs and H\,H2 < the corresponding 
subgroups. We say that T\ covers T2 or that T2 is a quotient of T\ if there is a 

surjective morphism rj : T\ -» 1^. We also say in this case that Hi covers H2, and 

x 

denote Ti -» T 2 or Hi -» H 2 . 

By "surjective" we mean surjective on both the vertices and the edges. Note 
that we use the term "covers" even though this is not a covering map in general (the 
morphism from Ti to 1^ is always locally injective at the vertices, but not necessarily 
locally bijective). 

For instance, H = (rcia^xj -3 , x^X2X\ 2 ) < F& covers the group 

J = (^2, X] 2 , X1X2X1), the corresponding core graphs of which are the leftmost and 

rightmost graphs in Figure |2"T2"1 As another example, every core graph T that contains 

edges of all labels covers the wedge graph 

x 

We already know (Claim 12. 2p that if Hi -» H2 then, in particular, Hi < H2. 
However, the converse is incorrect. For example, the group 

K = (xia^a^ 3 , xfa^i -2 ' -^s) contains H = ( ' ) (we simply added x 2 

as a third generator), yet K is not a quotient of H: the morphism rj : Tx{H) — )■ 

Tx{K) does not contain the 2- loop at the basepoint of Tx(K) in its image. 

x 

Note also that the relation Hi -» H 2 depends on the given generating set X of 

Ffc. For example, if H = (X1X2) then H -» (xi,X2) = F%. However, X1X2 is primitive 
and could be taken as part of the original basis of F2. In that case, the core graph 
of H would consist of a single vertex and single loop and would have no quotients 
except for itself. 

It is also interesting to note that every quotient of the core-graph V corresponds 
to some partition of V(T) (the partition determined by the fibers of the morphism). 
We can simply draw a new graph with a vertex for each block in the partition, and 
a j-edge from block bi to block 62 whenever there is some j-edge (1)1,1)2) in I\ with 
Hi G 61,^2 £ b 2 . However, not every partition of V(r) corresponds to a quotient core- 
graph: In the resulting graph two distinct j-edges may have the same origin or the 
same terminus. Note that even if a partition P of V(r) yields a quotient which is not 
a core-graph, this can be remedied. We can activate the folding process exemplified 
in Appendix [C] and obtain a core graph. The resulting partition P' of V(T) is the 
finest partition which yields a quotient core-graph and which is still coarser than P. 
We illustrate this in Figure 12.21 

Lemma 2.4. Every finite core-graph has a finite number of quotients. 
Equivalently, every H <f g F^ covers a finite number of other subgroups. 

Proof. The number of quotients of T is bounded from above by the number of par- 
titions of V(T). □ 
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Figure 2.2: The left graph is the core graph Tx{H) of H = (xia^x^ 3 , xfa^x^ 2 ) < 
F2. Its vertices are denoted vi,...,V4. The graph in the middle is the quotient 
corresponding to the partition P = {{i>i, U4}, {^2}, {^3}}- This is not a core graph 
as there are two different 1-edges originating at {t>i,t>4}. In order to obtain a core 
quotient-graph, we use the folding process illustrated in Appendix O The resulting 
core graph is on the right, corresponding to the partition P' = {{vx,v^}, {^2,^3}}- 

Following the notations in |MVW07] . we call the set of X-quotients of H the 
X -fringe of H and denote Ox{H). Namely, 

O x {H) := {T X (J) \ H^J} (2.1) 

Lemma [2.41 states in this terminology that for every H <f g (and every basis X), 
\O x (H)\< 00. 

Before describing our new perspective on core graphs, we remind some useful 
facts about free factors in free groups: 

Claim 2.5. Let H,J,K < F k . Then, 

1. Free factorness is transitive: If H < J < K then H < K. 

2. If 77 : Tx{H) ■=-)■ Tx{J) is an embedding then H < J. 

3. If H < J then H is a free factor in any subgroup H < M < J in between. 

Proof. The first and second claims are immediate. We give a "graph-theoretic" proof 

for the third one. Assume that H < J, and let Y be a basis of J extending some 
basis of H . In particular, Ty{H) and Ty(J) are both bouquets, consisting of a single 
vertex and rk(H) (resp. rk(J)) loops. Now, for every H < M < J, consider the 
morphism rj : Ty{H) — > Ty(M). It is easy to see that a core-graph-morphism of a 

bouquet must be an embedding. Thus, by the second claim, H < M. □ 
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3 Immediate Quotients and the DAG of Core Graphs 



The quotient relation yields a partial order on the set of core graphs. But we are 
interested in a special case which we call immediate quotients. This relation allows 
us to build the aforementioned DAG (directed acyclic graph) of all (core graphs 
corresponding to) finite rank subgroups of 

Let T be a core graph, and let P be a partition of V(r). Let A be the quotient 
core graph we obtain from P by the folding process described in Figures IC.ll and 
12.21 We say that A is generated from F by P. We are interested in the case where 
P identifies only a single pair of vertices: 

Definition 3.1. Let T be a core graph and let P be a partition of V(r) in which 
all parts consist of a single vertex with a single exceptional part that contains two 
vertices. Let A be the core graph generated by P. We then say that A is an 
immediate quotient of T. 

Alternatively we say that A is generated by merging a single pair of vertices of T. 
For instance, the rightmost core graph in Figure I2T21 is an immediate quotient of the 
leftmost core graph. 

The relation of immediate quotients has an interesting interpretation for the 
associated free groups. Let H, J < F& be free groups and T = Tx{H),A = Tx{J) 
their core graph, and assume A is an immediate quotient of T obtained by identifying 
the vertices u, v G ^(r). Now let p u ,p v G F*. be words corresponding to some paths 
in T from the basepoint to u and v respectively. It is not hard to see that identifying 
u and v has the same effect as adding the word w = PuPv 1 to H and considering the 
generated group. Namely, J = (H,w). 




Based on the relation of immediate quotients we consider the DAG XV The set of 
vertices of this graph consists of all finite core graphs with edges labeled by 1, . . . , k, 
and its directed edges connect every core graph to its immediate quotients. Every 
fixed ordered basis of Fk X = {xi, . . . ,Xk}, determines a one-to-one correspondence 
between the vertices of this graph and all finite rank subgroups of F&. 

As before, we fix an ordered basis X. For any H <f g Fk, the subgraph of of 
the descendants of Tx{H) consists of all quotients of Fx(H), that is of all elements of 
the X-fringe Ox{H)- By Lemma [2.4[ this subgraph is finite. In Figure [3TT1 we draw 
the subgraph of T>k consisting of all quotients of Tx{H) when H = (x\X2%\ X %2 
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The edges of this subgraph (i.e. immediate quotients) are denoted by the broken 
arrows in the figure. 

® — >■ • 

2 2 



V I I 

• >■ • 




Figure 3.1: The subgraph of T>^ induced by Ox{H), that is, all quotients of the core 
graph T = Tx (H), for H = (xix^^x^ 1 ) . The dashed arrows denote immediate 
quotients, i.e. quotients generated by merging a single pair of vertices. T has exactly 
seven quotients: itself, four immediate quotients, and two quotients at distance 2. 

This yields the aforementioned distance function between a finite core graph and 
a quotient of it: 

x 

Definition 3.2. Let Hx,H 2 </ s be finite rank subgroups such that H\ -» H 2 , 
and let T 1 = T x (Hi), T 2 = T X (H 2 ) be the corresponding core graphs. We define the 
distance between Hi and H 2 , denoted px{Hi, H 2 ) or p(r 1 ,r 2 ), to be the shortest 
length of a directed path from Ti to T 2 in T>^. 

In other words, px{Hi, H 2 ) is the length of the shortest series of immediate quo- 
tients that yields T 2 from I\. Equivalently, it is the minimal number of pairs of ver- 
tices that need to be identified in I\ in order to obtain T 2 (via the folding process). 
For example, if T 2 is an immediate quotient of Ti then px{Hi, H 2 ) = p(Ti, T 2 ) = 1. 
For H = (xix 2 x : [ 1 x 2 1 ), Tx(H) has four quotients at distance 1 and two at distance 
2 (see Figure l3~TT) . 

As aforementioned, by merging a single pair of vertices of Tx{H) (and then 
folding) we obtain the core graph of a subgroup J obtained from H by adding 
some single generator. Thus, by taking an immediate quotient, the rank of the 
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associated subgroup increases at most by 1 (in fact, it may also stay unchanged or 

x 

even decrease). This implies that whenever H -» J: 

rk(J)-rk(H) < p(H,J) (3.1) 

It is not hard to bound the distance from above as well: 

x 

Lemma 3.3. Let H, J <j g such that H -» J. Then 

rk(J)-rk(H) < p x (H,J) < rk(J) 

We postpone the proof of the upper bound to Appendix [Bl (In fact, this upper 
bound in not needed for the main results of this paper. We give it anyway in order 
to have the full picture in mind.) Theorem 11.11 then states that in the same setting, 
the lower bound is attained iff H is a free factor of J. In fact one can visualize 
these results in the following way. Consider a two dimensional table which contains 
all the elements of the fringe Ox{H) (each quotient of Tx{H) contained in some, 
not necessarily distinct, cell). The rows correspond to the rank and are indexed 
0, 1,2,3, .... The columns correspond to the distance from H and are also indexed 
0, 1, 2,3,.. .. We then put every quotient of H in the suitable cell in the table. Let 
r = rk(H) denote the rank of H. Lemma 13.31 then says that the (finitely many) 
elements of Ox{H) are spread across r + 1 diagonals in the table: the main one and 
the r diagonals below it. Theorem 11.11 implies that within Ox{H), H is a free factor 
of exactly those J-s found in the lowest of these r + 1 diagonals. (In fact, Lemma H~3l 
shows that tc(H) can also be read from this table: it equals the rank of the upmost 
occupied cell in this table outside the free-factor-diagonal.) 

3.1 Proof of Theorem [1TT1 

x 

The main result of this section states that if H <f g J <j g and H — » J, then 

p x (H,J)=rk(J)-rk(H) H < J. (3.2) 

In fact, one of the implications is trivial. As mentioned above, merging two 
vertices in Tx{H) is equivalent to adding some generator to H. If we manage to 
obtain Tx(J) from Tx{H) by rk(J) — rk(ff) merging steps, this means we can obtain 

J from H by adding rk(J) —rk(H) extra generators to H, hence clearly H < J (recall 
that by hopfianity of the free group, every generating set of size rk (J) is a basis of 
J, see e.g. |Bog08j Chapter 2.29]). Thus, 



Px(H,J) =rk{J) -rk{H) H < J (3.3) 

The other implication is not trivial. Assume that H < J. Our goal is to obtain 
rk(J) — rk(H) complementary generators of J from H, so that each of them can be 
realized by merging a pair of vertices in Fx(H). 
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To this goal we introduce the notion of a 11 handle number" associated with a 
subgroup M and a word w G (It also depends on the fixed basis X of F&). This 
number is defined as follows. Let V = Fx(M). Denote by p w the longest prefix of 
w that corresponds to some path from the basepoint of T (we trace the letters of 
w along T until we get stuck). Likewise, denote by s w the longest suffix of w that 
ends at the basepoint (here we trace w^ 1 from the basepoint until we get stuck). If 
\pw\ + \s w \ < \w\, then w = p w m w s w as a reduced expression for some 1 ^ m w G 
The handle number of (M, w) is then 



h x (M,w) = h(T,w) = 




\Pw\ + \S W \ < \W 

otherwise 



Claim 3.4. Assume that w M and let N = (M,w). Then 

1. hx (M,w) > if and only ifTx (M) is a (proper) subgraph of Fx (N), and 

2. hx (M,w) = if and only ifTx {N) is an immediate quotient ofTx {M). 

Proof. Assume first that hx (M, w) > 0. In the notations of the previous paragraph, 
let v(p w ),v(s w ) be the end point of the path corresponding to p w and the starting 
point of the path corresponding to s w . We can then add a "handle" to T in the 
form of a path corresponding to m w which starts at v(p w ) and ends at v(s w ). (If 
v (Pw) — v(s w ) this handle looks like a balloon, possibly with a string.) 




The resulting graph is a core-graph (the edge conditions at v(p w ) and v(s w ) hold, 
by the maximality of p w and s w ), and it corresponds to N. So we actually obtained 
Tx (N). It follows that Fx (M) is a proper subgraph of Fx {N). On the other hand, 
if hx (M,w) = 0, i.e. if \p w \ + \s w \ > \w\, we can find a pair of vertices in T whose 
merging adds w to H as a complementary generator for J. (We may take v(p w ) 
together with the vertex on the path of s w at distance \p w \ + \s w \ — \w\ from v(s w ).) 

□ 

The last claim shows, in particular, that if iV is obtained from M by adding a 
single complementary generator, then either Tx (N) is an immediate quotient or it 
contains Tx (M) as a proper subgraph. This already proves Theorem II. II for the case 
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rk (J) — rk (H) = 1: if H -» J, we are clearly in the second case of Claim EJT^j i.e. J 
is an immediate quotient of H. 

We proceed by defining handle numbers for a subgroup M <f g and an ordered 
set of words w%, . . . ,w t £ Let N = (M, Wi, . . . , w t ) and A = Yx (N). Let in 
addition Ni = (M, u>i, . . . , it?*) and Ti = TxiN). We obtain a series of subgroups 

M = N <N 1 <...<N t = N, 

and a series of graphs T = Tq, F±, . . . , T t = A. We denote by hx (M, Wi, . . . , Wt) the 
t-tuple of the following handle numbers: 

h x (M, w 1 ,..., w t ) = (h (r , twi) , h (Tt, w 2 ) , ■ . . , h x (r t _i, w t )) . 

Let us focus now on the case where t is the cardinality of the smallest set SCF^ 
such that iV = (M,S). The following lemma characterizes t-tuples of words for 
which the t-tuple of handle-numbers is lexicographically minimal. It is the crux of 
the proof of Theorem 11.11 

Lemma 3.5. In the above notations, let (wi, . . . ,w t ) be an ordered set of comple- 
mentary generators such that the tuple of handle numbers hx (M, w%, . . . , Wt) is lex- 
icographically minimal. Then the zeros in hx (M,Wi, . . . ,w t ) form a prefix of the 
tuple. 

Namely, there is no zero handle-number that follows a positive handle-number. 

Proof. It is enough to prove the claim for pairs of words (i.e. for t — 2), the general 
case following immediately. Assume then that N = (M, w±, w 2 ), that 2 is the min- 
imal number of complementary generators of iV given M, and that hx {M,wi,w 2 ) 
is lexicographically minimal. In the above notation, assume to the contrary that 
h(T ,Wi) > and h(T 1 ,W2) = 0. Let mi = m m denote the handle of W\ in T . 
Thus Fx was obtained from r by adding a handle (or a balloon) representing m\. 
The word w 2 can be expressed as w 2 = ps so that there is a path corresponding to 
p in Ti, emanating from the basepoint and ending at some vertex u, and there is 
a path s to the basepoint from a vertex v. (Clearly, u ^ v for otherwise w 2 £ N\ 
contradicting the minimality of t = 2.) Thus T 2 is attained from Ti by identifying 
the vertices u and v. By possibly multiplying w 2 from the left by a suitable element 
of Ni, we can assume that p does not traverse the handle mi "more than necessary". 
Namely, if u does not lie on m±, then p avoids m±, and if it does lie on m±, then only 
the final segment of p traverses mi till u. The same holds for s and v (with right 
multiplication). 

The argument splits into three possible cases. 

• If both u, v belong already to r , then h(T ,w 2 ) = 0. In this case we can 
switch between w 2 and w% to lexicographically reduce the sequence of handle 
numbers, contrary to our assumption. 
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• Consider next the case where, say, v G V(T ) but u G V(ri) \ V(T ), i.e, u 
resides on the handle m\. Then, the handle needed in order to add W2 to Tq 
is strictly shorter than h(To,Wi) = \m\\. Again, by switching W2 with w\ the 
sequence of handle numbers goes down lexicographically - a contradiction. 




• In the final case that should be considered both u and v are on the handle mi. 
I.e. u,v G V(ri) \ V(Tq). Assume w.l.o.g. that when tracing the path of mi, 
u precedes v. As before we can premultiply and postmultiply u> 2 by suitable 
elements of Ni to guarantee the following: The path p, from the basepoint of 
Ti to u, goes through T and then traverses a prefix of mi until reaching u, 
and the path s from v to the basepoint traces a suffix of mi and then goes 
only through Tq. Again h(To,W2) < h(To,Wi), so that switching W2 with W\ 
lexicographically reduces the sequence of handle numbers. (A similar argument 
works in the case mi constitutes a balloon.) 




□ 

* 

Theorem 11.11 follows easily from this lemma. Indeed, assume that H < J and that 

H -» J. Let t = rk (J) — rk (H) denote the difference in ranks, so that t is the smallest 
number of complementary generators needed to obtain J given H. Let (wi, . . . , w t ) 
be an ordered set of complementary generators so that hx {H, Wi, . . . , w t ) is lexico- 
graphically minimal. Similarly to the notations above, let Jj = (H,Wi, . . . ,Wi) and 

r, r.v (-a 
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By the lemma, there is some < q < t so that h (To, Wi) = . . . = h (r g _i, w q ) = 
whereas h(T q , w q +i), . . . , h (IY_i, Wt) are all positive. By Claim [33] it follows that 
Ti is an immediate quotient of for 1 < i < q, and therefore J q G Ox {H) 
and px {H, J q ) = q. (This in fact shows that px(H, J q ) < rk(J q ) — rk(H), and the 
equality follows from Lemma 13. 3j) . 

Using Claim lBT^l again. we see that Tj is a proper subgraph of T i+1 for q < i < t— 1. 
So that T q is a subgraph of T t = Tx {J)- But then the image of the graph morphism 
7] : Tx (H) — > Tx {J) is clearly the subgraph T q . If q < t this is a proper subgraph, 

which contradicts the assumption H -» J . Hence q = t and px {H, J) = t, as 
required. Together with (13. 3p this completes the proof of Theorem 11.11 □ 
In fact, the same argument yields a more general result: 

Corollary 3.6. Let H < J < be f.g. groups, and let t be the minimal number 
of complementary generators needed to obtain J from H . Then t is computable as 
follows. Let i] : Tx {H) — > Tx {J) be the unique core-graph morphism, and let M be 
the intermediate subgroup corresponding to the image 77 (1^ {H)). Then, 

t = p x (H, M) + rk (J) - rk (M) . 

Proof. In the notation of the last part of the proof of Theorem II. 1[ we see that 
M = J q < J, and so p x (H, M) + rk (J) - rk (M) = p x (H, J q ) + (t - q) = t. □ 

Remark 3.7. Note that in the crucial arguments of the proof of Theorem 11.11 we 
did not use the fact that the groups where of finite rank. Indeed, this result can be 
carefully generalized to subgroups of of infinite rank. 

* 

Remark 3.8. Another way to interpret Theorem 11.11 is by saying that if H < J 

and H -» J with t = rk (J) — rk(H), then there exists some set {w[, . . . , w' t } of 
complementary generators such that each Wi can be realized by merging a pair of 
vertices in Tx {H). To see this, let Wi, . . . , Wt be as in the proof above, so Wi can be 
realized by merging a pair of vertices u and v in Let r]i_i : Tx {H) — > be 

the surjective morphism, and pick any vertices in the fibers u' G v^ 1 (u) , v' G i]^ 1 (v). 
Let w\ be some word corresponding to the merging of u' and v' in Tx {H). It is not 
hard to see that for each i, (H, Wi, . . . , Wj) = (H, w[, . . . , w'j). 

4 More on the Primitivity Rank 

Recall Definitions 11.71 and 11.81 where we defined ir(w), the primitivity rank of a word 
w G Ffc, and vr(if), the primitivity rank of H <f g F^. In this subsection we prove 
some characteristics of this categorization of formal words, and show it actually 
depends only on the quotients of the core graph Tx{H) (or Tx((w))). The claims 
are stated for subgroups, and can be easily interpreted for elements with the usual 
correspondence between the element and the subgroup it generates (w). We 

begin by characterizing the possible values of tt(H). 
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Lemma 4.1. Let if <f g F k . Then 

H < F k & ir{H) = oo. 

Proof. Recall that 7r(ff) is defined by the smallest rank of subgroups of where if 
is contained but not as a free factor. If if is not a free factor of F^, then F& itself is 

one such subgroup so that tt(H) < k < oo. If if < F&, Claim [2751 shows it is a free 
factor in every other subgroup containing it. Thus, in this case tt(H) = 00. □ 

Corollary 4.2. For every if <f g F k 

ir(H) G {0, 1, ...,£;} U {00} 

In the definition of the primitivity rank of a subgroup if, we consider all subgroups 
of Fk containing H but not as a free factor. It turns out it is enough to consider 
only subgroups of F^ that are covered by if, that is, groups whose associated core 
graphs are in the X-fringe Ox{H)- 

Lemma 4.3. For every if <f g F k 

ir(H) = min lrk(J) H ~* 3 and 1 (4.1) 

^ H is not a free factor of J J 

Moreover, all H -critical subgroups ofF^ are covered by H. 

Proof. Recall that //-critical subgroups of F^ are the subgroups of smallest rank in 
which H is not a free factor (so in particular their rank is exactly ir(H)). It is enough 
to show that every iJ-critical subgroup has its associated core graph in the fringe 
O x {H). 

Consider an if -critical subgroup J < F&. This J contains H but not as a free 
factor. By Claim |2~21 there exists a morphism rj : Tx{H) — > Tx{J). If i] is surjective 

then H —» J and Tx{J) G Ox{H)- Otherwise, consider J', the group corresponding 
to the core graph f](Tx(H)). This graph, Tx{J'), is a strict subgraph of Tx{J), 

and so J' <z J (see Claim 123]) . In particular H -» J' and rk(J') < rk(J). It is 

impossible that H < J', because by transitivity this would yield that H < J. Thus, 
J' is a subgroup in which if is a not free factor, and of smaller rank than J. This 
contradicts the fact that J is if-critical. □ 

We note that in the terminology of [KM021 IMVW07] . if -critical subgroups are 
merely a special kind of "algebraic extensions" of H . (An algebraic extension of H 
is a group J such that for every M with H < M ^ J, M is not a free factor of 
J.) Specifically, if -critical subgroups are algebraic extensions of if of minimal rank, 
excluding if itself. Our proof actually shows the more general fact that all algebraic 
extensions of if can be found in the fringe (this fact appears in [KM02, 1MVWQ7] ). 
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At this stage we can describe exactly how the primitivity rank of a subgroup 
H <f g Ffc can be computed. In fact, all algebraic extensions and critical subgroups 
of H can be immediately identified: 

Corollary 4.4. Consider the induced subgraph ofD^ consisting of all core graphs in 
O x (H). Then, 

• The algebraic extensions of H are precisely the core graphs which are not an 
immediate quotient of any other core graph of smaller rank. 

• The .^-critical subgroups are the algebraic extensions of smallest rank, exclud- 
ing H itself, and tt (H) is their rank. 

Proof. The second statement follows from the discussion above and from definition 
11.81 The first statement holds trivially for H itself. If J is a proper algebraic extension 
of H, then by the proof of Lemma 14.31 J £ Ox {H). If Tx {J) is an immediate 

quotient of some Tx (M) of smaller rank, where M G Ox (H), then H < M < J 

* 

and by (the easier implication of) Theorem 1 1.1 1 we conclude M < J, a contradiction. 
On the other hand, if J G Ox {H) is not an algebraic extension of H, then there 

is some intermediate subgroup L such that H < L J. We can assume L G Ox {H) 
for otherwise it can be replaced with U corresponding to the image of the morphism 

77 : T x (H) T x (L) (whence V G O x (H) and H < V < L < J). From (the 
harder implication of) Theorem 11.11 it follows that px (L, J) = rk(J) — rk(L). The 
prior-to-last element in a shortest path in T>k from Tx (L) to Tx {J) is then a proper 
free factor of J at distance 1 that belongs to Ox (H). □ 

As an example, consider H = {x\X2X^ l %2 1 )- The full lattice of groups in Ox(H) is 
given in Figure 1370 There is one group of rank 1 (H itself), 5 of rank 2 and one 
of rank 3. The only group in the lattice where H in not a free factor is the group 
{x\X2j = F2, of rank 2, so tt(H) = 2. (And the set of algebraic extensions of H is 
precisely {H, F 2 }.) 

5 The Calculation of <fi 

The proof of Proposition 11.91 as well as the reasoning that underlies Conjecture II. 10\ 
are based on the fact that both <fi(H) and tt(H) can be calculated by analyzing 
Ox{H), the set of quotients of Tx{H). In the previous section it was shown how 
ir(H) is determined from Ox{H). In this section we show how (f>(H) can be calculated 
by a simple analysis of the very same set. The origins of the algorithm we present 
here can be traced to |Nic94| with further development in |LP10] . We present it here 
from a more general perspective. 

Let the group G act on a set Y and let yo G Y be a fixed element. Consider a 
random homomorphism ac G Hom(Fk,G). The core graphs in Ox{H) can be used 
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to calculate the probability that ao(H) C G yo (where G yo is the stabilizer of the 
element yo). The quotients of the core graph Tx(H) correspond to all the different 
"coincidence patterns" of the paths of yo through the action of the images of all w G 
H, thereby describing disjoint events whose union is the event that ao(H) C G yo . 

The idea is that in order to determine whether oq(w) stabilizes yo for some 
«) G F^, we do not need to know all the values ac(xi) over X{ G X (the given basis of 
Ffe). Rather, we only need to know how aa(xi) acts on certain elements in Y, those in 
the path of y through oq(w). Namely, if w = x^ . . . Xj™ , ji G {1, . . . , k}, e« G {±1}, 
we need to uncover the elements yi,...,y\ w \ in the following diagram: 

aa{xA a G (x e A aaix^A a G\x}™\j a G\x}™\) 

yo *- Hi *- V2 *- • • • *- y\w\-i *- y\w\ 

That is, the image of x^ acts on yo, and we denote the resulting element by y± G Y. 
The image of y\ under the action of ctG^jl) is denoted by yi, etc. Then, y is a fixed 
point of a G (w) iff y\ w \ = y . 

There are normally many possible series of elements yi, ■ ■ ■ ,y\ w \-i G Y that can 
constitute the path of yo through cxg(w) such that yo is a fixed point. We divide 
these different series to a finite number of categories based on the coincidence pattern 
of this series. Namely, two realizations of this series, yi, . . . , y\ w \~i, and y[, . . . , yLij 
are in the same category iff for every i,j G {0, . . . , \w\ — 1}, yi = yj y\ = y'j 
(note that the elements of the series are also compared to yo). In other words, every 
coincidence pattern corresponds to some partition of {0, 1, . . . , \w\ — 1}. 

However, because the elements 0.0(0; j) G G act as permutations on Y, not every 
partition yields a realizable coincidence pattern: if, for example, 2^ = X J 7 € *' '> an d 
Hi — Ut-i we must also have yi = y§. This condition should sound familiar. Indeed, 
for each coincidence pattern we can draw a pointed, directed, edge-labeled graph 
describing it. The vertices of this graph correspond to blocks in the partition of 
{0, 1, . . . , \w\ — 1}, the basepoint corresponding to the block containing 0. Then, for 
each ie{l,...,|w|} there is a jj-edge, directed according to between the block of 
i — 1 to the block of i. The constraints that coincidence patterns must satisfy then 
becomes the very same ones we had encountered in our discussion of core graphs. 
Namely, no two j-edges share the same origin or the same terminus. 

Thus, the different realizable coincidence patterns of the series 2/0,2/1, . . . ,y\ w \-i 
are exactly those described by core graphs that are quotients of Tx((w)). For in- 
stance, there are exactly seven realizable coincidence patterns that correspond to the 
event in which yo is a fixed point of ao(w) when w — [x±, x^\. The seven core graphs 
in Figure 13.11 correspond to these seven coincidence patterns. 

Finally, the same phenomenon generalizes to any H <f g Instead of uncover- 
ing the path of yo through the image of a single word, we uncover the paths trough 
all words in H. The union of these paths in which y is stabilized by all elements of 
H is depicted exactly by the core graph Tx(H). The realizable coincidence patterns 
correspond then to the quotients of Fx(H), namely to Ox(H). To summarize: 
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r , ^-^ r r describes the coincidence pattern , 

Prob\a G {H) C G yo ] = V Prob\ r , , , p (5.1) 

z — ' of Wn through the action of ac<(H) 

The advantage of the symmetric group S n with its action on {1, . . . , ft} is that 
the probabilities in the r.h.s. of (15.11) are very easy to formulate. Let a n = as n G 
Hom(Fk, S n ) be a uniformly distributed random homomorphism, and let T G Ox{H) 
be one of the quotients of Tx{H). Denote by Pr{n) the probability that a n (H) C 
(S n )i and that the coincidence pattern of the paths of 1 through the elements olg{H) 
are described by V . Then we can give an exact expression for Pr{n) in terms of Vr, er 
and ef, the number of vertices, edges and j-edges in T: 

There are (n — l)(ft — 2) . . . (n — + 1) possible assignments of different ele- 
ments from {2, 3, ... , ft} to the vertices of T (excluding the basepoint which always 
corresponds to the element 1). Then, for a given assignment, there are exactly e 3 r 
constraints on the permutation a n (xj). So the probability that the permutation 
a n (xj) agrees with the given assignment is 

(n-4)! 1 



ft! 



(for ft > ef). Thus 



Pr(n) 



ft(ft — 1) ... (ft — ef + 1) 



(ft - l)(ft - 2) . . .(ft - v r + 1) 
IlLi n ( n - 1) • • • (ft - ef + 1) 



Recall the definition of $jj(n) in ( II. ip . Since for every j and every T G Ox{H) 
we have ef < ef x (.ff) we can summarize and say that for every n > max,,- ef x (in (in 
particular for every n > Vr x <H)), we have: 



$ H ( n ) = Pro6 [Vw G # a n (w)(l) = 1] 



n rk(H) 

Prob[a n (H) C (S^ ' 



■k(H) + 



(ft - l)(ft - 2) . . .(ft - v r + 1) 



^ r6 ^{ fl) n* =1 n(n-l)...(n-4 + l) 

J_ + v 1 (iziKizili^izS^) (52) 
^ r e ^,)^ r+1 nUa-i)-a-^ 



For instance, for H = ([xi,X2]) there are seven summands in the r.h.s. of (15.21) . 
corresponding to the seven core graphs in Figure 13.11 If we go over these core graphs 
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from top to bottom and left to right, we obtain that for every n > 2: 

. . 1 (n-l)(n-2)(n-3) 

n n{n — 1) ■ n(n — 1) 

n — 1 n—1 in — l)(n — 2) 

-| 1 1 i ^ '- h 

n(n — 1) • n n ■ n{n — 1) n{n — 1) • n(n — 1) 

(n-l)(n-2) n-l 1 



n(n — 1) ■ n(n — 1) n(n — 1) • n(n — 1) n ■ n 



1 1 1 

+ 



n n — 1 n{n — 1) 

Recall the definition of the second categorization of sets of free words, <f)(H), in 
(11. 3p . Indeed, we can rewrite (15. 2p as a power series in -, and obtain that (for large 
enough n) 

ai(H) 



7T 
i=0 



where the coefficients cii(H) depend only on H . We need not consider negative values 
of % because the leading term of every summand in (15. 2p is er J-„ r+1 , and er — v r + 1 
is non- negative for connected graphs. In fact, this number also equals the rank of 
the free subgroup corresponding to T. 

The value of 4>(H) equals the smallest % for which cii(H) does not vanish. For 
instance, for H = ([xi,^]) we have 



1 00 1 

n{n — 1) n l 



so that ao(H) = a^H) = and a^H) = 1 for i > 2. Hence <f>(H) = 2. 

In fact, we can write a power series for each T G Ox{H) separately, and obtain: 

1 (l-i)(l-^)...(l-^i) 
Pr(n) - y n n n 



nL(i-i)...(i 



ef-1- 



1 T(l- m "?" ( ^ +0(^)) < 5 - 3 ) 



n e r -v r +l\ n \ n l 

This shows that if V = Fx(J) (J </ 5 F^), then Pr{n) never affects a,i(H)-s with 
i < rk(J). It is also easy to see that all the coefficients of the power series expressing 
Pr{n) are integers. We summarize: 

Claim 5.1. For every H <f g F^, all the coefficients a,i(H) are integers. 

Moreover, ai{H) is completely determined by core graphs in Ox{H) corresponding to 

groups of rank < i . 
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6 Relations between tt(-) and </>(•) 

We now have all the background needed for the proof of Proposition 11.91 and con- 
sequently of Theorem 11.51 We need to show that for every H <f g and every 
i < rk(H) + 1, we have 

tt(H) =i^^(f)(H) =i. 

The proof is divided into three steps. First we deal with the case % < rk(H), then 
with % = rk(H). The last case % = rk(H) + 1 is by far the hardest. 

Lemma 6.1. Let H <f g F^ and i < rk(H). Then 

1. n(H) = i^cf)(H) = i 

2. If 7r(H) = <p{H) = i then ai{H) equals the number of if-critical subgroups of 
F fc . 

x 

Proof. Let m denote the smallest rank of a group J < F/. such that H -» J (so 
m < rk(H)). The first part of the result is derived from the observation that both 
tc(H) = i and 4>{H) — % iff m = i. Let us note first that tt(H) = i ^ m = i. This 
follows from Lemma 14.31 and the fact that H cannot be a free factor in a subgroup 
of smaller rank. 

We next observe that 4>{H) = i m = v. If m < rk(H) then by (15. 2p and 
(15. 3p . m is indeed the smallest index for which a m (H) does not vanish (this does 
not work for m = rk(H) because of the term ( — nr l (H) ) in the definition of $#(n)). 
Conversely, if m = rk(H) then obviously <p{H) > rk(H). 

For the second part of the lemma, recall that H is not a free factor in any subgroup 
of smaller rank containing it. Thus, by (15. 3p and Lemma |4.3[ both a»(i?) and the 
number of subgroups of rank i containing H equal the number of subgroups of rank 
i in O x {H). □ 

The case i = rk(H) is slightly different, but almost as easy. 
Lemma 6.2. Let H <j g F k . Then, 

1. w(H) = rk{H) & (f)(H) = rk(H) 

2. If tt(H) = <p{H) = rk(H) then a r k(H){H) equals the number of .^-critical 
subgroups of Ffc. 

Proof. From Lemma [6.11 we infer that tt(H) > rk(H) <^> <j>{H) > rk(H). So we as- 
sume that indeed ir(H), (j>{H) > rk(H), or, equivalently, that there are no subgroups 
covered by H of rank smaller than rk(H). 

We show that both sides of part (1) are equivalent to the existence of a quotient 
(corresponding to a subgroup) of rank rk(H) in Ox{H) other than Tx{H) itself. 
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Indeed, this is true for 7r(H) because the only free product of if of rank rk(H) is if 
itself. 

As for 0(if), this is true because when 4>(H) > rk(H) it is easily verified that the 
value of a r k(H){H) equals the number of quotient in Ox{H) of rank rk(H) minus 1 
(this minus 1 comes from the term ( — rfc 1 (g) J ) . We think of this term as offsetting 
the contribution of Tx{H) to a r i-rm(H), so a r k(H){H) equals the number of other 
quotients in Ox{H) of rank rk(H). 

The second part of the lemma is true because all if -critical subgroups are covered 
by if (Lemma USD. □ 

6.1 The Case % = rk{H) + 1 

The most interesting (and the hardest) case of Theorem II .51 is when rk(H) = k — 1. 
In the previous analysis this corresponds to i = rk(H) + 1. 

Lemma 6.3. Let H <j g F k . Then, 

1. tt(H) = rk(H) + 1 <f)(H) = rk(H) + 1 

2. If 7r(if) = 4>{H) = rk(H) + 1 then a r k(H)+i(H) equals the number of if-critical 
subgroups of F fc . 

Denote by f = Tx{H) the associated core graph. By Lemmas 16.11 and 16. 2\ we can 
assume that 7r(H),<f)(H) > rk(H) + 1. In particular, we can thus assume that if is 
not contained in any subgroup of rank smaller than rk(H) + 1 other than H itself. 

The coefficient a r k(H)+\{H) in the expression of $//(n) is the sum of two expres- 
sions: 

• The contribution of f which equals — ( (^) — Ylj=i ( 6 | ) ) 

• A contribution of 1 from each core graph of rank rk(H) + 1 in Ox{H) 

Thus, our goal is to show that the contribution of f is exactly offset by the contri- 
bution of the core graphs of rank rk(H) + 1 in Ox{H) in which H is a free factor. 
This would then yield immediately both parts of Lemma 16.31 But the number of 
subgroups of rank rk(H) + 1 (in Ox{H)) in which H is a free factor equals exactly 
the number of immediate quotients of T: Theorem 11.11 shows that only immediate 
quotients of T are subgroups of rank rk(H) + 1 in which if is a free factor. On the 
other hand, (13. ip and the assumption that H in not contained in any other subgroup 
of equal or smaller rank yield that every immediate quotient of Y is of rank rk(H) + 1 
(and if is a free factor in it) . 

Thus, Lemma 16.31 follows from the following lemma. 
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Lemma 6.4. Assume n(H),(j)(H) > rk(H). Then T = Tx{H) has exactly 



immediate quotients. 



The intuition behind Lemma 16.41 is this: Every immediate quotient of T is gener- 
ated by identifying some pair of vertices of T, and there are exactly rjj such pairs. 
But for every pair of equally-labeled edges of T, the pair of origins generates the 
same immediate quotient as the pair of termini. This intuition needs, however, some 
justification that we now provide. 

To this end we use the graph T, a concept introduced in [LFIOB This graph 
represents the pairs of vertices of T and the equivalence relations between them 
induced by equally-labeled edges. There are f u r) vertices in T, one for each un- 
ordered pair of vertices of f . The number of directed edges in T is Y^=i ("jf ) > 
one for each pair of equally-labeled edges in T. The edge corresponding to the 
pair {ei,e 2 } of j-edges is a j-edge connecting the vertex {original), origin^)} to 
{ter minus (ei), terminus (€2)}- For example, when S consists of the commutator 
word, T has („) = 6 vertices and + = 2 edges. We illustrate a slightly more 
interesting case in Figure 16.11 




{«0>«l} {«0,«2} {"0,f4} {V2,V6} {v 2 , V 3 } {^0,^6} {vi,V3} 



{vi,v 2 } 



{V4,V&} {vi,v 6 } 

• • {VQ,V 3 } 



{v3,ve} 



{vi,V5} 



>{V2,V4} 



{"3^4} {^0^5} {"2,^5} {^1,U4} {Vi,V5} {v 5 ,v 6 } {v 3 ,v 5 } 



Figure 6.1: The graph T (on the right) corresponding to f = Tx{H) (on the left) 
for H = (xfx2XiX2X- l ~ 1 X2) ■ (The vertices of T are denoted here by Vq, . . . , t^.) 

We denote the set of connected components of T by Comp(T). The proof of 
Lemma 16.41 will follow from two facts that we show next. Namely, T has exactly 

(If ) — SjLi ( e | ) connec t e d components. Also, there is a one-to-one correspondence 
between Comp(T) and the set of immediate quotients of T. 

3 This is a variation of the classical construction of pull-back of graphs (in this case the pull-back 
of the graph T with itself). 
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Claim 6.5. If w(H),4>(H) > rk(H), then 



icompmi = (; f )-x:(f) 

Proof. Since T has ( u r) vertices and Y^!j=i ClO edges, it is enough to show that it is 
a forest, i.e., it contains no cycles. 

Let C G Comp(T) be some component of T. Clearly, every vertex in C (which 
corresponds to a pair of vertices in T) generates the same immediate quotient. Denote 
this quotient by A(C), and the corresponding subgroup by J. In particular, rk{J) = 
rk(H) + 1 (recall that under the claim's assumptions, H is not contained in any 
other subgroup of smaller or equal rank). Assume to the contrary that C contains a 
cycle. Edges in T are directed and labeled, so there is an element u G which 
corresponds to this cycle started, say, at the vertex {x, y}. 

{x,y} 
. -P 

\ u 



Where do we get as we walk in the core graph f starting at x (resp. y) and 
following the path corresponding to ul One possibility is that the walk from x 
returns back to x and likewise for y. Alternatively this w-walk can take us from 
x to y and from y to x. We consider only the former possibility. The latter case 
would be handled by considering the walk corresponding to u 2 . Let p x ,Py G be 
words corresponding to some paths from the basepoint of F to x, y respectively. In 
particular, p x up~ l iVyWy 1 £ H. 




Merging x and y is equivalent to adding the generator p x Py to H, so that J = 
(H^pxPy 1 ). Since rk(J) = rk(H) + 1, we have that J = H * (PxPy 1 )- Consider the 
elements hi = p x upx l £ H and /i 2 = p y up~ x G H. The following equality holds: 

fll =P X UP~ 1 = (PxPy^PyUPy^PxPy 1 )' 1 = (PxPy 1 )h 2 (PxPy 1 y 1 
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This is a contradiction, since we obtained two different expressions for hi in the free 
product J = H * (pxPy 1 )- □ 

Remark 6.6. Let H < and x, y G V (Tx (H)). In the proof of the last claim it 
was shown that if there is some 1 / u G Ft which is readable as a closed path at 
both x and y, then the subgroup we obtain by merging them is not a free extension 
of H. We stress that the converse is not true. For example, consider H = (a, bob) < 
F ({a, b}). Then r.r a « (H) has three vertices, no pair of which share a common closed 
path (in other words, the corresponding graph T has no cycles). However, T{ a ,b\ {H) 
has exactly two immediate quotients, none of which is a free extension. 

Next we exhibit a one-to-one correspondence between Comp(T) and the imme- 
diate quotients of T. It is very suggestive to try and restore C from A(C) by simply 
signaling out the pairs of vertices that are identified in A(C). But this is too naive. 
There may be pairs of vertices not in C that are identified in A(C). For instance, 
consider C, the rightmost component of T in Figure ISTTl In A(C) we merge V\ and V3 
but also t> 3 and v 5. Thus V\ and v 5 are merged and likewise all pairs in the component 
of {v h v 5 }. 

However, simple group-theoretic arguments do yield this sought-after result: 

Claim 6.7. If tt (H) , <p(H) > rk(H), then there is a one-to-one correspondence be- 
tween Comp(T) and the set of immediate quotients ofT = Tx{H). 

Proof. Maintaining the above notation, we need to show that the mapping from 
C G Comp(T) to A(C), the immediate quotient generated by any of the pairs in C, 
is one to one. 

Denote by J the subgroup corresponding to the immediate quotient A(C). Let 
{x, y} be some vertex in C, and p x ,p y G F& words corresponding to some paths from 
the basepoint of T to x,y, respectively. Let also q = p x p~ x G F fe . As we saw above, 

J=(H,q) 

and clearly q ^ H. 

We claim that any other complementary generator of J over H is in same (H, H)- 
double-coset of q or of g" 1 in J. Namely, if J = (H, q') then q' G HqH U Hq~ x H. 
To see this, let Y be some basis of H and think of J as the free group over the basis 

Y U {q} (this is true because rk(J) = rk(H) + 1). Now think of q 1 as a word in 
the elements of this basis. Multiplying from the right or left by elements of Y does 
not affect the (H, iJ)-double-coset, so assume w.l.o.g. that q' begins and ends with 
either q or q^ 1 . But then the set YU{q'} is Nielsen-reduced with respect to the basis 

Y U {q} (see, for instance, the definition in Chapter 1 of [LS70]). As consequence, 



Y U {</} equals Y U {q} up to taking inverses (Proposition 2.8 therein). Thus q' = q 



or q' = q 1 . 



So let {a, b} be another pair of vertices generating A(C). We show that it belongs 
to C. Let PaiPb be words in F& corresponding to paths in T from the basepoint to 
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a, b respectively. We have (H,p a p b l ) = J, so p a p b 1 G HqH U Hq 1 H. W.l.o.g. it 
is in (otherwise switch a and 6). So assume PaP^ 1 — h\qh 2 with hi,h 2 G if. 

But h^pa is also a path from the basepoint of T to a, and likewise /i 2 p& a path to b. 
Choosing if needed these paths instead of p a ,Pb we can assume that 

PaPb 1 = <l = PxPy 1 - 

Thus, 

P~a l Px = Pb l Py 

This shows that there is a path in T from a to x corresponding to a path from b to 
y. This shows precisely that the pair {a, b} is in the same component of T as {x, y}, 
namely, in C. □ 

This completes the proof of Lemma [6731 This Lemma, together with Lemmas 16.21 
and 16. 1[ yields Proposition 11.91 and thus Theorem 11.51 



6.2 Further Relations between 7r( ) and 0( ) 

Let us take another look now at Conjecture 11.101 It posits that the results described 
in Proposition 11.91 hold for all values of tt(-) and </>(•). To understand what this 
means, suppose that if is a free factor in all the quotients in Ox{H) of ranks up 
to i — 1. What can be said about rank-z quotients in which if is a free factor? 
The conjecture states that their number exactly offsets the sum of two terms: The 
contribution to ai(H) of the quotients of smaller rank and of the term ^} H) when 
i = rk(H). For instance, ir(H) = 3 for H = (x^x^x^). In particular, if is a free 
factor of all quotients in Ox{H) of rank < 2. There is a single if-critical subgroup 
(F3 itself), and additional 13 quotients of rank 3 in which if is a free factor. The 
contribution of quotients of rank < 2 to 03 (if) is indeed exactly (—13). 

Interestingly enough, this is indeed the case for every free factor if < In this 
case, since free factors are measure preserving, we get that 0(ff) = 00, so Oj(ff) = 
for every i, and the statement of the previous paragraph holds. For the general case 
the conjecture states that as long as we consider low-rank quotients and "imprimi- 
tivity has not been revealed yet", the situation does not differ from what is seen in 
the primitive case. 

We finish this section by stating another result connecting tt(-) and </>(■). It 
shows an elegant property of both of them that lends further support to our belief 
in Conjecture 11.101 

Two words Wi,W2 G F& are called disjoint (with respect to a given basis) if they 
share no common letters. 

Lemma 6.8. Let u>i,u>2 G F& be disjoint. Then 

tt(w 1 w 2 ) = n(wi) + tt(w 2 ) 
4>{w 1 w 2 ) = 4>{wi) + 0(w 2 ) 
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Moreover, a^ mW2 ) (W1W2) = a<j>(v>i)(wi) ■ a(j>{w 2 ){ w 2) , an d if part 2 of Conjecture \1.10\ 
holds for H = (wi) and for H = (102), then it also holds for H = (W1W2) ■ 

This lemma is essentially outside the scope of the present paper, so we only sketch 
its proof. Let a n G Hom{Yk, S n ) be a random homomorphism chosen with uniform 
distribution. As Wi and u> 2 are disjoint, the random permutations a n (wi) and a n {w2) 
are independent. The claims about the additivity of <$(■) and the multiplicativity 
of are easy to derive by calculating the probability that 1 is a fixed point of 

W\W2- The key fact in this calculation is the aforementioned independence of a n (wi) 
and a n {w2). 

The other claims in the lemma follow from an analysis of if -critical subgroups. 
By considering properties of the associated core graphs it is not hard to show that 
J < F fc is (u^u^-critical iff it is the free product of a (u^-critical subgroup and a 
(u?2)- cr itical subgroup. 

7 Primitive Words and the Pro-finite Completion 

Most of the standard facts below about profinite groups and particularly free profinite 
groups can be found with proofs in [Wil98j (in particular Section 5.1). 

A profinite group is a topological group G with any of the following equivalent 
properties: 

• G is the inverse limit of an inverse system of finite groups. 

• G is compact, Hausdorff and totally disconnected. 

• G is isomorphic (as a topological group) to a closed subgroup of a Cartesian 
product of finite groups. 

• G is compact and f](N\N < G) = 1 

The free profinite group on a finite set A is a profinite group F together with a map 
j : X — > F with the following universal property: whenever £ : A — > G is a map 
to a profinite group G, there is a unique (continuous) homomorphism £ : F — > G 
such that £ = £j. Such F exists for every A and is unique up to a (continuous) 
isomorphism. We call j{X) a basis of F. It turns out that every two bases of F have 
the same size which is called the rank of F. The free profinite group of rank k is 
denoted by EV An element w G is primitive if it belongs to some basis. 

It is a standard fact that is the profinite completion of and F^ is naturally 
embedded in F&. Moreover, every basis of F^ is then also a basis for Fj., so a primitive 
word w G Ffc is also primitive as an element of F/.. It is conjectured that the converse 
also holds: 

Conjecture 7.1. A word w G F& is primitive in Fk iff it is primitive in F^. 
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This conjecture, if true, immediately implies the following one: 

Conjecture 7.2. The set of primitive elements in F k form a closed set in the profi- 
nite topology. 

Conjecture 11.41 implies these last two conjectures (it is in fact equivalent to Con- 
jecture 17.1^ see below): we define measure preserving elements in F& as before. 
Namely, an element w 6 Fj. is measure preserving if for any finite group G and a 
uniformly distributed random (continuous) homomorphism etc G Hom(Fk,G), the 
image &g{w) is uniformly distributed in G. Clearly, an element of F& is measure 
preserving w.r.t F& iff this holds w.r.t. F&. 

As in the abstract primitive element of F& is measure preserving. Con- 

jecture [T31 would therefore imply that if w G is primitive in F&, then w is also 
primitive w.r.t. F&. In particular, Theorem 11.51 yields: 

Corollary 7.3. Let S C F& be a finite subset of cardinality \S\ > k — 1. Then, 
S can be extended to a basis in F*. •<=>- S can be extended to a basis in F& 
In particular, for every w 6 F 2 : 

w is primitive in F 2 w is primitive in F 2 

This corollary yields, in turn, Corollary 11.61 which states the special case of 
Conjecture 17.21 for F 2 . 

As shown by Chen Meiri (unpublished), Conjectures 17.11 and 11.41 are equivalent. 
With his kind permission we explain this result in this section. Meiri showed that 
in Ffc primitivity and measure preservation are equivalent (Proposition 17.41 below). 
Thus, w G Ffc is primitive as an element of F^ iff it is measure preserving. 

Proposition 7.4. [G Meiri, unpublished] Let w belong to F&. Then 

w is primitive -<=>- w is measure preserving 

Proof. The (=^) implication is trivial as in the abstract case: for every finite group 
G and every basis x%, . . . , x k of there is a bijection 

Hom(F k ,G) 4 G k 
a G h)> (a G (xi), . . . ,aa(xk)) 

For the other direction, for every w G F k , finite group G and g G G define 

H w (G,g) = jct G G Hom(F k ,G) | a G (w) = 
E w (G,g) = |a G G Epi(F k , G) | a G {w) = 
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Now assume w G is measure preserving, and let x G F& be any primitive element. 
For every finite group G we have \H w (G,g)\ = |G| fe_1 = \H X (G, g)\. The same 
equality holds for the set of epimorphisms, namely \E w (G,g)\ = \E X (G, g)\. We will 
show this by induction on \G\. 

If \G\ = 1 the claim is trivial. The inductive step goes as follows: if g G G, then 

\E w (G,g)\ = \H w (G,g)\- £ \E w (H,g)\ = 

gdH<G 

= \H x (G,g)\- \E x (H,g)\ = \E x (G,g)\ 

geH<G 

Now choose a basis xi,...,Xk of F^. For every N <o F&, \E xl (Fk/N,wN)\ = 
\E w (F k /N,wN)\ > 1. If a G E Xl (F k /N,wN) then wN = a(xi),a(x 2 ), . . . ,a(x k ) 
generate F k /N. A standard compactness argument shows that there are elements 
w 2 , • • • , Wk G Ffc such that {wN, w 2 N, . . . , w^N} generate F^/N for every N <q F&. 
But then {w, w 2 , ■ ■ ■ ,Wk} generate F& as well. Whenever k elements generate F&, 
they generate it freely. Thus {w, w 2 , ■ ■ ■ , Wk} is a basis and w is primitive. □ 

8 The Average Number of Fixed Points in a n {w) 

As before, let a n G Hom(Fk, S n ) be a uniformly distributed random homomorphism. 
In ( 11. ip we defined the function ^/^(n) = for every io G Fjt. It considers the 

probability that ot n {w) fixes the element 1 and quantifies its deviation from -. The 
choice of the element 1 is arbitrary, of course, and we get the same probability for 
every element in 1, . . . , n. Thus n§ w (n) + 1 is the average number of fixed points of 
the random permutation a n (w). 

Corollary 14.21 states that in F 2 there are exactly four possible primitivity ranks of 
words. This translates through Proposition 11.91 to four possibilities for the average 
number of fixed points in the permutation a n (w), as summarized by Tabled) 



7r(w) /<f)(w) 


Description 


Prob[a n (w){l) = 1] 


Avg # of f.p. of a n (w) 





w = 1 


1 


n 


1 


w is a power 


n ' n 1 2-^i= 2 n i 


l + ai («,) + 0(I) 


2 




1 | CL2{w) , v^°° <h(w) 


i + a2(w) + o(\) 


oo 


w is primitive 


n 


1 



Table 1: The possibilities for the average number of fixed points of the permutation 
a n (w) for some w G F 2 . 



Recall that all coefficients a,i(w) are integers (Claim loTTT) . Moreover, in these cases 
a<f>(w){w) counts the (w)-critical subgroups of F2, so in particular a^ w \(w) > 0. We 
thus obtain 
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Corollary 8.1. For every word to6F 2 and every large enough n, the average number 
of fixed points of a n (w) is at least 1. 

This leads to the following conjecture, which is a consequence of Conjecture II. 101 

Conjecture 8.2. For every word w G and every large enough n, the average 
number of fixed points of a n (w) is at least 1. 

Proposition 11.91 says something about free words in general. If <p(w) < 2 for some 
w G Ffc, then the first non- vanishing coefficient a^( w )(w) is positive. Thus, 

Corollary 8.3. For every word w G the average number of fixed points in a n (w) 
is at least 1 — O(^) . 

It is suggestive to ask whether Conjecture 18.21 holds for all n. Namely, is it true 
that for every u> G F fc and every n, the average number of fixed points in a n (w) is 
at least 1? By results of Abert ( |Abe06| ). this statement turns out to be incorrect. 

A Note Added in Proof 

Remark 8.4. After this paper was completed, we learned about the algorithm of 
Silva and Weil to detect free-factor subgroups in the free group [SW08]. In essence, 
their algorithm relies on the same phenomenon that we independently noticed here. 
However, our reasoning is very different, and offers several substantial advantages 
over the presentation in [SW08J. A more elaborate discussion of the differences 
between the two approaches appears in Appendix |A] 

Remark 8.5. In subsequent joint work with O. Parzanchevski [PP12], we manage 
to prove Conjecture 11.41 in full. That proof relies on Theorem 11.11 and follows the 
general strategy laid out in the current paper. In particular, we establish Conjectures 
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Appendices 



A An Algorithm to Detect Free Factors 

One of the interesting usages of Theorem 11.11 is an algorithm to detect free factor 
subgroups and consequently, also to detect primitive words in Ffc. The algorithm 
receives as input H and J, two finitely generated subgroups of Ffc, and determines 

whether H < J. The subgroups H and J are given to us by specifying a generating 
set, where members of the generating sets are words in the elements of the fixed basis 
X. (Note that the algorithm in particular decides as well whether H < J, but this 
is neither hard nor new). 

We should mention that ours is not the first algorithm, nor the first graph- 
theoretic one, for this problem (see Chapter 1.2 in |LS70j ). We already mentioned 
(Remark I8.4[) [SW08J , who noticed the basic phenomenon underlying our algorithm, 
albeit in a very different language. See Remark IA.2I below for an explanation of 
the differences. A well-known algorithm due to Whitehead solves a much more gen- 
eral problem. Namely, for given 2r words Wi, . . . , w r , Ui, . . . , u r G Ffc, it determines 
whether there is an automorphism a G Aut(Fk) such that a(w{) = Ui for each i 
QWhi36aJ,[Whi36bJ. For a good survey see Chapter 1.4 at [LS70] . A nice presenta- 
tion of the restriction of Whitehead's algorithm to our problem appears in [Sta99j). 
Quite recently, Roig, Ventura and Weil introduced a more clever version of the White- 
head algorithm for the case of detecting primitive words and free factor subgroups 
[RVW07J. Their version of the algorithm has polynomial time in both the length of 
the given word w (or the total length of generators of a given subgroup H) and in k, 
the rank of the ambient group Ffc. To the best of our knowledge, their algorithm is 
currently the best one for this problem, complexity- wise. The algorithm we present 
is, at least naively, exponential, as we show below (Remark IA. 1|) . 

So assume we are given two subgroups of finite rank of Ffc, H and J, by means 
of finite generating sets Sh, Sj. Each element of Sjj, Sj is assumed to be a word in 
the letters X U X^ 1 (recall that X = {xi, . . . , Xk} is the given basis of Ffc). To find 

out whether H < J, follow the following steps. 

Step 1: Construct Core Graphs and Morphism 

First, construct the core graphs T = Tx{H) and A = Tx{J) by the process described 
in Appendix O Then, seek a morphism rj : F — > A. This is a simple process that 
can be done inductively as follows: i] must map the basepoint of T to the basepoint 
of A. Now, as long as r] is not fully defined, there is some j-edge e = (u, v) in E(T) 
for which the image is not known yet, but the image of one of the end points, say 
rj(u), is known (recall that T is connected). There is at most one possible value that 
77(e) can take, since the star of i](u) contains at most one outgoing j-edge. If there 
is no such edge, we get stuck. Likewise, rj(v) must equal the terminus of 77(e), and 
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if 7](v) was already determined in an inconsistent way, we get a contradiction. If in 
this process we never get stuck and never reach a contradiction, then 77 is defined. 
If this process cannot be carried out, then there is no morphism from T to A, and 
hence H is not a subgroup of J (see Claim |2T2"|) . 

x 

Step 2: Reduce to Two Groups with H -» J 

After constructing the morphism 7] : T — > A, we obtain a new graph from A by 
omitting all edges and all vertices not in the image of rj. Namely, 

A' := r,(T) 

It is easy to see that A' is a core-graph, and we denote by J' the subgroup corre- 

sponding to A'. Obviously, A' is a quotient of r, so H -» J'. Moreover, it follows 
from Claim 1231 that 

* * 
H <J H < J'. 

* 

Step 3: Use px{H, J') to determine whether H < J' 

Now calculate px(H, J') (this is clearly doable because the subgraph of P*. consisting 
of quotients of T is finite). Thanks to Theorem px{H, J') determines whether 

or not H < J', and consequently, whether or not H < J. 



Remark A.l. The complexity of this algorithm is roughly 0(v 2t ), where v is the 
number of vertices in Tx{H) and t is the difference in ranks: t = rk(J) — rk(H). 
Naively, we need to go over roughly all possible sets of t pairs of vertices of Tx{H) 
and try to merge them (see Remark 13. 7j) . The number of possibilities is at most 

(y), which shows the claimed bound. (In fact, we can restrict to pairs where both 
vertices are in the same fiber of the morphism rj : Tx{H) — > Tx{J)-) 

A.l Examples 

We illustrate the different phases of the algorithm by two concrete examples. Con- 
sider first the groups H = (xiX2X-^ 1 x 2 ~ 1 , X2X 2 ) and J = (x±, x 2 3 , XiX^ 1 , £1X2X1), 
both in F 2 . The core graphs of these groups are: 
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In this morphism r\ from T = Tx{H) to A = Fx(J) can be constructed. 

All the vertices of T are in the image of 77, and only one edge, the long 2-edge at the 
bottom, is not in r](E(T)). Thus A' is: 




and J' is the corresponding subgroup J' = (x^ , XiX 2 x , X\X 2 X\) . 

Finally, rk(H) = 1 — = 2 and rk(J') = 1 — x(A') = 3, and so the difference 
is rk(J') — rk(H) = 1. It can be easily verified that A' is indeed an immediate 
quotient of T: simply merge the upper-right vertex of T with the bottom-left one to 

obtain A'. Thus p x (H, J') = 1 = rk(J') - rk(H), and so H < J' hence H < J. 

As a second example, consider the commutator word w = X\X 2 x^ x x 2 x . We want 
to determine whether it is primitive in F3. We take H = (w) and the core graphs 
are then 




3 



1 (^r >^> . 



Once again, a morphism 77 from T = Tx(H) to A = IV (F3) can be constructed, 
and there is a single edge in A, the 3-edge, outside the image of 77. Thus A' is the 
quotient of T which is the bottom graph in Figure l3~7fl and J' is simply F2. 

Finally, rk(H) = 1 — x(T) = 1 and rk(F 2 ) = 2, and so the difference is rk(F 2 ) — 
rk(H) = 1. But as we infer from Figure EHJ p x (H,F 2 ) = 2. Thus p x (H,F 2 ) > 
rk(F 2 ) —rk(H) and H is not a free factor of F 2 . As consequence, w is not primitive 
in F3. (This example generalizes as follows: if w is a free word containing exactly / 
different letters, then w is primitive iff we can obtain a wedge-of-loops graph from 
Fx((w)) by merging / — 1 pairs of vertices.) 

Remark A. 2. At this point we would like to elaborate on the differences between 
the algorithm presented here and the one introduced in [SW08J. Silva and Weil's pre- 
sentation considers automata and their languages. We consider the A- fringe O x {H) 
and introduce the DAG and the distance function from Definition 13.21 Steps 
1 and 2 of our algorithm, which reduce the problem in its very beginning to the 
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case where H -» J, have no parallel in [SW08J. Rather, they show that if H < J, 
then by some sequence of "i-steps" (their parallel of our immediate quotients) on 
H, of length at most rk(J) — rk(H), one can obtain a core graph which is em- 
bedded in Tx{J) (we make the observation that this embedded core graph can be 
computed in advance). Besides shedding more light on this underlying phenomenon, 
our more graph-theoretic approach has another substantial advantage: by consider- 
ing "Dfc, turning the fringe Ox{H) into a directed graph and stating the algorithm 
in the language of Theorem II. 1[ we obtain a straight-forward algorithm to identify 
iY-critical subgroups and to compute tt{H). Moreover, we obtain a straight-forward 
algorithm to identify all "algebraic extensions" of H ( Corollary 14. 4p . In particular, our 
algorithm to identify algebraic extensions substantially improves the one suggested 
in |KM02| . Theorem 11.3 (and see also remark 11.4 about its efficiency). 



B The Proof of Lemma 13.3 

To complete the picture, we prove the upper bound for px{H, J) stated in Lemma 
13.31 We need to show that if H, J < f g such that H -» J, then 

Px(H,J) < rk(J) 

Proof. We show that A = Tx{J) can be obtained from T = Tx{H) by merging at 
most rk( J) pairs of vertices. To see this, denote by m the number of edges in T, and 
choose some order on these edges, ex, . . . , e m so that for every i, there is a path from 
the basepoint of T to traversing only edges among e±, . . . , ej_i. (So e± must be 
incident with the basepoint, e<i must be incident either with the basepoint or with 
the other end of e\, etc.) 

We now expose A step by step, each time adding the images of the next edge 
of r and of its end points. Formally, denote by rj the (surjective) morphism from T 
to A, let Tj be the subgraph of T that is the union of the basepoint of T together 
with e\, . . . , ej and their endpoints, and let Aj = 77(1^). We thus have two series of 
subgraphs 

r c r\ c . . . c r m = r 

and 

A C Ai C . . . C A m = A 

with A = T being graphs with a single vertex and no edges. 

Assume that ej = (u,v), and w.l.o.g. that u G V(Ti_i). We then distinguish 
between three options. A forced step is when rj(ei) already belongs to Aj_i and 
then A, = Aj_i. A free step takes place when neither r/(ej) nor 77(f) belong to A»_i, 
in which case 7Ti(Aj) = 7Ti(Aj_i). The third option is that of a coincidence. This 
means that ri(ej) does not belong to Aj_ x but rj(v) does. In this case, A, is obtained 
from Aj_! by connecting two vertices by a new edge, and ^(Aj) has rank larger by 
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1 from the rank of 7Ti(Aj_i). Since the fundamental group of Ao has rank 0, this 
shows there are exactly rk(J) coincidences along this process. 

Assume the coincidences occurred in steps ji, . . . , j' r k(j)- If = (u,v), we let 
v G ?7 _1 (?7(f )) PI V(Ti-i), and take {v , u} to be a pair of vertices of T that we merge. 
(It is possible that v = w.) Let Wi G be be a word corresponding to this merge in 
T. It is easy to see by induction that A^ corresponds to the subgroup (H, wi, . . . , w^. 
In particular, A corresponds to (H,W\, . . . , u> r k(j)). We are done because all these 
words correspond to pairs of vertices in T (and see Remark \3.7\i . □ 

C The Folding Algorithm to Construct Core Graphs 



Finally, we present a well known algorithm to construct the core graph of a given 
subgroup H <f g Ffc. The input to this process is any finite set of words {hi, . . . , h r } 
in the letters {xi, . . . , x^} that generate H. 




2 



Figure C.I: Generating the core graph Tx(H) of H = (xiX2Xi 3 , x 1 2 X2X 1 ' 2 ) < F2 
from the given generating set. We start with the upper left graph which contains a 
distinct loop at the basepoint for each (reduced) element of the generating set. Then, 
gradually and at arbitrary order, we merge pairs of equally-labeled edges which share 
the same origin or the same terminus. One of the possible orders of merging pairs is 
shown in this figure, and at each phase we mark by triple arrows the pair of edges 
being merged. The graph in the bottom right is Tx{H), as it has no equally-labeled 
edges sharing the same origin or the same terminus. 

Every element hi of the generating set corresponds to some path with directed 
edges labeled by the x^s (we assume the element is given in reduced form). Merge 
these r paths to a single graph by identifying all their 2r end-points to a single vertex 
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which is denoted as basepoint. Then, as long as there are two j-labeled edges with 
the same terminus (resp. origin) for some j, merge the two edges and their origins 
(resp. termini). Such a step is often referred to as a Stallings' folding. It is a fairly 
easy observation that the resulting graph is indeed Tx{H) and that the order of 
folding has no significance. To illustrate, we draw in Figure IC.ll the folding process 
by which we obtain the core graph Tx(H) of H = (xiX2X^ 3 , x^X2X^ 2 ) < F 2 from 
the given generating set. 
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